Arabian Gulf
Seek and You Shall Fold
Sellam, Nadav Bojan, Bojan, Meital, Schanda, Paul, Bronstein, Alex
Accurate protein structures are essential for understanding biological function, yet incorporating experimental data into protein generative models remains a major challenge. Most predictors of experimental observables are non-differentiable, making them incompatible with gradient-based conditional sampling. This is especially limiting in nuclear magnetic resonance, where rich data such as chemical shifts are hard to directly integrate into generative modeling. We introduce a framework for non-differentiable guidance of protein generative models, coupling a continuous diffusion-based generator with any black-box objective via a tailored genetic algorithm. We demonstrate its effectiveness across three modalities: pairwise distance constraints, nuclear Overhauser effect restraints, and for the first time chemical shifts. These results establish chemical shift guided structure generation as feasible, expose key weaknesses in current predictors, and showcase a general strategy for incorporating diverse experimental signals. Our work points toward automated, data-conditioned protein modeling beyond the limits of differentiability.
- North America > United States > Michigan (0.04)
- Europe > Austria (0.04)
- Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
- (2 more...)
USF-MAE: Ultrasound Self-Supervised Foundation Model with Masked Autoencoding
Megahed, Youssef, Ducharme, Robin, Erman, Aylin, Walker, Mark, Hawken, Steven, Chan, Adrian D. C.
Ultrasound imaging is one of the most widely used diagnostic modalities, offering real-time, radiation-free assessment across diverse clinical domains. However, interpretation of ultrasound images remains challenging due to high noise levels, operator dependence, and limited field of view, resulting in substantial inter-observer variability. Current Deep Learning approaches are hindered by the scarcity of large labeled datasets and the domain gap between general and sonographic images, which limits the transferability of models pretrained on non-medical data. To address these challenges, we introduce the Ultrasound Self-Supervised Foundation Model with Masked Autoencoding (USF-MAE), the first large-scale self-supervised MAE framework pretrained exclusively on ultrasound data. The model was pre-trained on 370,000 2D and 3D ultrasound images curated from 46 open-source datasets, collectively termed OpenUS-46, spanning over twenty anatomical regions. This curated dataset has been made publicly available to facilitate further research and reproducibility. Using a Vision Transformer encoder-decoder architecture, USF-MAE reconstructs masked image patches, enabling it to learn rich, modality-specific representations directly from unlabeled data. The pretrained encoder was fine-tuned on three public downstream classification benchmarks: BUS-BRA (breast cancer), MMOTU-2D (ovarian tumors), and GIST514-DB (gastrointestinal stromal tumors). Across all tasks, USF-MAE consistently outperformed conventional CNN and ViT baselines, achieving F1-scores of 81.6%, 79.6%, and 82.4%, respectively. Despite not using labels during pretraining, USF-MAE approached the performance of the supervised foundation model UltraSam on breast cancer classification and surpassed it on the other tasks, demonstrating strong cross-anatomical generalization.
- North America > Canada > Ontario > National Capital Region > Ottawa (0.14)
- Europe > Switzerland (0.04)
- North America > United States > Texas > Kleberg County (0.04)
- (8 more...)
- Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (1.00)
- Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
- Health & Medicine > Therapeutic Area > Oncology > Gastric Cancer (0.34)
IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents
Our dataset includes half a million design patents comprising 3.61 million figures along with captions from patents granted by the United States Patent and Trademark Office (USPTO) over a 16-year period from 2007 to 2022. We incorporate the metadata of each patent application with elaborate captions that are coherent with multiple viewpoints of designs.
- North America > United States > Illinois > Cook County > Chicago (0.04)
- North America > United States > Illinois > Champaign County > Urbana (0.04)
- Asia > Middle East > UAE > Abu Dhabi Emirate > Arabian Gulf (0.04)
Event Driven CBBA with Reduced Communication
Sao, Vinita, Ho, Tu Dac, Bhore, Sujoy, Sujit, P. B.
In various scenarios such as multi-drone surveillance and search-and-rescue operations, deploying multiple robots is essential to accomplish multiple tasks at once. Due to the limited communication range of these vehicles, a decentralised task allocation algorithm is crucial for effective task distribution among robots. The consensus-based bundle algorithm (CBBA) has been promising for multi-robot operation, offering theoretical guarantees. However, CBBA demands continuous communication, leading to potential congestion and packet loss that can hinder performance. In this study, we introduce an event-driven communication mechanism designed to address these communication challenges while maintaining the convergence and performance bounds of CBBA. We demonstrate theoretically that the solution quality matches that of CBBA and validate the approach with Monte-Carlo simulations across varying targets, agents, and bundles. Results indicate that the proposed algorithm (ED-CBBA) can reduce message transmissions by up to 52%.
- Europe > Russia (0.04)
- Europe > Norway > Central Norway > Trøndelag > Trondheim (0.04)
- Asia > Russia (0.04)
- (3 more...)
- Information Technology > Security & Privacy (0.48)
- Telecommunications > Networks (0.34)
When Prompt Engineering Meets Software Engineering: CNL-P as Natural and Robust "APIs'' for Human-AI Interaction
Xing, Zhenchang, Liu, Yang, Cheng, Zhuo, Huang, Qing, Zhao, Dehai, Sun, Daniel, Liu, Chenhua
With the growing capabilities of large language models (LLMs), they are increasingly applied in areas like intelligent customer service, code generation, and knowledge management. Natural language (NL) prompts act as the ``APIs'' for human-LLM interaction. To improve prompt quality, best practices for prompt engineering (PE) have been developed, including writing guidelines and templates. Building on this, we propose Controlled NL for Prompt (CNL-P), which not only incorporates PE best practices but also draws on key principles from software engineering (SE). CNL-P introduces precise grammar structures and strict semantic norms, further eliminating NL's ambiguity, allowing for a declarative but structured and accurate expression of user intent. This helps LLMs better interpret and execute the prompts, leading to more consistent and higher-quality outputs. We also introduce an NL2CNL-P conversion tool based on LLMs, enabling users to write prompts in NL, which are then transformed into CNL-P format, thus lowering the learning curve of CNL-P. In particular, we develop a linting tool that checks CNL-P prompts for syntactic and semantic accuracy, applying static analysis techniques to NL for the first time. Extensive experiments demonstrate that CNL-P enhances the quality of LLM responses through the novel and organic synergy of PE and SE. We believe that CNL-P can bridge the gap between emerging PE and traditional SE, laying the foundation for a new programming paradigm centered around NL.
- Asia > Middle East > UAE > Abu Dhabi Emirate > Arabian Gulf (0.04)
- Asia > China > Jiangxi Province (0.04)
- Health & Medicine (0.67)
- Energy (0.67)
- Education (0.46)
CoordField: Coordination Field for Agentic UAV Task Allocation In Low-altitude Urban Scenarios
Zhang, Tengchao, Tian, Yonglin, Lin, Fei, Huang, Jun, Süli, Patrik P., Ni, Qinghua, Qin, Rui, Wang, Xiao, Wang, Fei-Yue
With the increasing demand for heterogeneous Unmanned Aerial Vehicle (UAV) swarms to perform complex tasks in urban environments, system design now faces major challenges, including efficient semantic understanding, flexible task planning, and the ability to dynamically adjust coordination strategies in response to evolving environmental conditions and continuously changing task requirements. To address the limitations of existing methods, this paper proposes CoordField, a coordination field agent system for coordinating heterogeneous drone swarms in complex urban scenarios. In this system, large language models (LLMs) is responsible for interpreting high-level human instructions and converting them into executable commands for the UAV swarms, such as patrol and target tracking. Subsequently, a Coordination field mechanism is proposed to guide UAV motion and task selection, enabling decentralized and adaptive allocation of emergent tasks. A total of 50 rounds of comparative testing were conducted across different models in a 2D simulation space to evaluate their performance. Experimental results demonstrate that the proposed system achieves superior performance in terms of task coverage, response time, and adaptability to dynamic changes.
- Asia > Macao (0.05)
- Europe > United Kingdom > North Sea > Southern North Sea (0.05)
- Asia > China > Tianjin Province > Tianjin (0.05)
- (3 more...)
- Transportation > Infrastructure & Services (0.94)
- Transportation > Ground > Road (0.68)
Hegseth tears up red tape, orders Pentagon to begin drone surge at Trump's command
National Review editor-in-chief Rich Lowry and FOX Business' Liz Claman join'MediaBuzz' to discuss Hegseth's heated press conference where he called out the media's'hatred' of President Donald Trump. FIRST ON FOX: Defense Secretary Pete Hegseth has issued sweeping new orders to fast-track drone production and deployment, allowing commanders to procure and test them independently and requiring drone combat simulations across every branch of the military. As part of an aggressive push to outpace Russia and China in unmanned warfare, "the Department's bureaucratic gloves are coming off," Hegseth wrote. "Lethality will not be hindered by self-imposed restrictions... Our major risk is risk-avoidance." In a pair of memos first obtained by Fox News Digital, Hegseth rescinded legacy policies that he believes restricted innovation.
- North America > United States (1.00)
- Europe > Russia (0.26)
- Asia > Russia (0.26)
- (8 more...)
- Government > Regional Government > North America Government > United States Government (1.00)
- Government > Military (1.00)
The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound
VanBerlo, Blake, Wong, Alexander, Hoey, Jesse, Arntfield, Robert
Data augmentation is a central component of joint embedding self-supervised learning (SSL). Approaches that work for natural images may not always be effective in medical imaging tasks. This study systematically investigated the impact of data augmentation and preprocessing strategies in SSL for lung ultrasound. Three data augmentation pipelines were assessed: (1) a baseline pipeline commonly used across imaging domains, (2) a novel semantic-preserving pipeline designed for ultrasound, and (3) a distilled set of the most effective transformations from both pipelines. Pretrained models were evaluated on multiple classification tasks: B-line detection, pleural effusion detection, and COVID-19 classification. Experiments revealed that semantics-preserving data augmentation resulted in the greatest performance for COVID-19 classification - a diagnostic task requiring global image context. Cropping-based methods yielded the greatest performance on the B-line and pleural effusion object classification tasks, which require strong local pattern recognition. Lastly, semantics-preserving ultrasound image preprocessing resulted in increased downstream performance for multiple tasks. Guidance regarding data augmentation and preprocessing strategies was synthesized for practitioners working with SSL in ultrasound.
- North America > United States > New York > New York County > New York City (0.04)
- North America > Canada > Ontario > Middlesex County > London (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- (2 more...)
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.68)
- Health & Medicine > Therapeutic Area (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
A Comprehensive Analysis of Adversarial Attacks against Spam Filters
Hotoğlu, Esra, Sen, Sevil, Can, Burcu
Deep learning has revolutionized email filtering, which is critical to protect users from cyber threats such as spam, malware, and phishing. However, the increasing sophistication of adversarial attacks poses a significant challenge to the effectiveness of these filters. This study investigates the impact of adversarial attacks on deep learning-based spam detection systems using real-world datasets. Six prominent deep learning models are evaluated on these datasets, analyzing attacks at the word, character sentence, and AIgenerated paragraph-levels. Novel scoring functions, including spam weights and attention weights, are introduced to improve attack effectiveness. This comprehensive analysis sheds light on the vulnerabilities of spam filters and contributes to efforts to improve their security against evolving adversarial threats. Introduction Deep learning has seen significant advancements in the field of natural language processing (NLP), particularly in tasks such as ...
- North America > United States > Texas > Travis County > Austin (0.04)
- Oceania > Australia > Victoria > Melbourne (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- (5 more...)
- Information Technology > Security & Privacy (1.00)
- Government > Military (1.00)
Assessing data-driven predictions of band gap and electrical conductivity for transparent conducting materials
Ottomano, Federico, Goulermas, John Y., Gusev, Vladimir, Savani, Rahul, Gaultois, Michael W., Manning, Troy D., Lin, Hai, Manzanera, Teresa P., Poole, Emmeline G., Dyer, Matthew S., Claridge, John B., Alaria, Jon, Daniels, Luke M., Varma, Su, Rimmer, David, Sanderson, Kevin, Rosseinsky, Matthew J.
Machine Learning (ML) has offered innovative perspectives for accelerating the discovery of new functional materials, leveraging the increasing availability of material databases. Despite the promising advances, data-driven methods face constraints imposed by the quantity and quality of available data. Moreover, ML is often employed in tandem with simulated datasets originating from density functional theory (DFT), and assessed through in-sample evaluation schemes. This scenario raises questions about the practical utility of ML in uncovering new and significant material classes for industrial applications. Here, we propose a data-driven framework aimed at accelerating the discovery of new transparent conducting materials (TCMs), an important category of semiconductors with a wide range of applications. To mitigate the shortage of available data, we create and validate unique experimental databases, comprising several examples of existing TCMs. We assess state-of-the-art (SOTA) ML models for property prediction from the stoichiometry alone. We propose a bespoke evaluation scheme to provide empirical evidence on the ability of ML to uncover new, previously unseen materials of interest. We test our approach on a list of 55 compositions containing typical elements of known TCMs. Although our study indicates that ML tends to identify new TCMs compositionally similar to those in the training data, we empirically demonstrate that it can highlight material candidates that may have been previously overlooked, offering a systematic approach to identify materials that are likely to display TCMs characteristics.
- Europe > United Kingdom > England > Merseyside > Liverpool (0.04)
- Asia > Middle East > Iran > Tehran Province > Tehran (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- (4 more...)